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Abstract 

The  field  of  expert  systems  has  claimed  numerous  and  successful 
applications  ranging  from  simple  tasks,  such  as  monitoring  a  valve  or 
fluid  rate,  to  very  complex  tasks  that  may  include  process  scheduling  or 
design  However,  to  construct  a  system  requires  the  developer  to  isolate 
a  set  of  rules  that  will  guide  the  decision  making  process.  Rules  are 
conventionally  defined  during  a  series  of  interviews  between  a  domain 
expert  and  a  knowledge  engineer  The  encoding  and  representation  of  the 
extracted  domain  knowL.Ige  has  proven  to  be  a  difficult  barrier  to 
overcome.  KASH  (Knowledge  Acquisition  SUell)  has  been  designed  to 
address  this  problem  by  providing  a  knowledge  engineer  with  a  set  of 
utilities  for  constructing  knowledge  acquisition  sessions  based  on 
interviewing  techniques.  The  information  elicited  from  domain  experts 
during  the  sessions  is  guided  hy  a  question  dependency  graph  fQfXr). 
The  QDG,  defined  hy  the  knowledge  engineer,  is  a  scries  of  control 
questions  about  the  domain  that  are  used  to  organize  (he  cognitive 
processes  of  an  expert.  The  content  information  supplied  by  the  expert, 
in  response  to  the  questions,  is  represented  in  the  form  of  a  concept 
map.  These  maps  can  be  constructed  in  a  top-down  or  bottom-up 
manner  by  the  QDG  and  used  by  KASH  to  generate  the  rules  for  a  large 
class  of  expert  system  dom.iins. 

Introduction 

Knowledge  acquisition  tools  use  a  variety  of  techniques  to 
elicit  from  a  domain  expert  the  essential  information  for  building 
an  expert  system.  The  application  domains  of  these  tools  can  be 
aligned  into  two  categories;  analysis  (e.g.,  diagnosis, 
identification,  interpretation)  and  synthesis  (e.g..  scheduling, 
planning,  design).  Analysis  is  a  top-down  process  providing  an 
examination  of  a  set  of  goals  that  are  decomposed  into  simpler 
and  more  basic  elements  and  relationships.  For  example,  FIX 
(Rodi,  Fierce  and  Dalton,  1989)  is  a  diagnostic  system  based  on 
class  attributes  for  representing  fault  detection  and  isolation. 
Synthesis  is  a  bottom-up  process  supporting  the  composition  or 
combination  of  given  facts  and  observations  that  are  used  to 
construct  a  set  of  goals.  For  example,  SALT  (Marcus,  1988)  is  a 
tool  based  on  a  propose-and-revise  method  for  designing 
elevator  configurations. 

The  above  classes  of  tools  work  well  within  their  intended 
domains,  but  are  not  easily  transferred  to  other  applications. 
Consequently,  the  time  and  effort  required  for  producing  the 
systems  must  be  reinvested  for  each  instance.  This  will  increase 
the  costs  to  the  customer  and  subsequently  decrease  reusability 
and  portability  of  the  tools  across  domain  applications.  To 
address  the  different  categories  (analysis  or  synthesis),  a  system 
must  be  able  to  work  within  the  constraints  set  by  each.  Expert 
system  shells  currently  provide  the  capacity  to  build  top-down  or 
bottom-up  applications  by  supplying  the  developer  with  a  finite 
collection  of  objects  to  represent  the  domains.  The  manner  in 
which  the  objects  are  defined  and  related  to  other  objects 
determines  the  degree  of  acceptability  of  the  system.  Therefore, 
if  the  focus  of  producing  a  knowledge  acquisition  system  is 
placed  on  the  structure,  rather  than  on  content,  a  more  diverse 
selection  of  applications  may  be  addressed. 


One  knowledge  acquisition  system  providing  a  domain- 
specific  structure,  KLAMShcll  (Cochran,  1988),  was  developed 
to  aid  in  the  construction  of  knowledge  bases  for  maintenance 
and  troubleshooting.  The  shell  elicits  information  via  subgoal 
satisfaction  in  a  depth  first  manner  to  retain  a  conicxt  focus  on 
the  knowledge  structure.  Each  goal  defined  is  decomposed  (via 
"push"  menus)  into  a  scries  of  suhgoals.  The  bottom-most 
nodes  arc  transformed  (via  "pop"  menus)  into  actions,  such  as 
questions  or  instructions,  to  be  used  in  the  final  system.  This 
process  continues  until  all  goals  have  been  specified  The  system 
is,  however,  domain  specific  and  the  generality  of  its  guidance  is 
limiled  to  only  a  small  subset  of  the  domain,  thereby  restricting 
its  use. 

Another  system  that  may  be  classified  as  a  general  purpose 
knowledge  acquisition  shell  is  PROTEGE  (Musen,  1989).  It  is  a 
tool  capable  of  generating  other  knowledge  acquisition  systems 
using  planning  entities,  task  level  actions,  and  input  data.  The 
methodologies  used  by  PROTEGE  separate  the  modeling  or  a 
domain  (i.e.,  control  structure)  from  the  application  knowledge 
(Musen.  1988),  thereby  customizing  each  system.  If oxvever 
PROTEGE  relics  on  a  set  of  fixed  templates  to  elicit  information 
and  this  will  limit  the  range  of  systems  that  can  be  produced. 

From  this  perspective,  KASH  (Knowledge  Acquisition 
SMcll)  has  been  defined  as  a  domain-independent,  knowledge 
elicitation  shell.  The  shell  provides  a  general  purpose  (reusable) 
environment  for  encoding  the  problem-solving  methods  of 
domain  experts.  A  set  of  three  independent  modules  (Figure  I) 
operate  within  the  top-down  (analysis)  and  bottom-up 
(synthesis)  constraints  of  the  domains.  These  modules  arc: 
Concept  Formulation ,  for  eliciting  knowledge  from  a  domain 
expert  and  structuring  it  into  a  concept  map;  Knowledge 
Analysis,  to  verify  the  concept  map  and  cross  check  any 
inconsistencies  found;  Rule  Generation,  to  produce  a  rule  set  for 
an  expert  system  based  on  the  concept  map.  A  question 
dependency  graph  (QDG)  supplies  the  control  structures  used  in 
the  concept  formulation  module.  Control  structures,  defined  a 
priori  by  knowledge  engineers,  are  necessary  to  obtain  the 
content  knowledge  from  the  experts.  The  utility  of  the  QDG 
allows  the  control  structures  to  be  defined  as  needed  and 
variations  of  the  graph  can  easily  address  new  applications. 

Concepts  and  Base  Facts 

The  basis  for  acquiring  knowledge  in  KASH  is  organizing 
the  cognitive  processes  an  expeit  many  formulate  about  a 
domain.  The  mechanisms  to  organize  the  processes  must  be 
tailored  to  the  idiosyncratic  representations  created  by  the 
experts.  However,  experts  tend  to  express  their  knowledge  in 
unstructured  and  nondeterministic  formats  and  difficulties  arise 
when  the  knowledge  must  be  converted  into  a  reliable  and 
useable  format  for  expert  system  development.  To  properly 
address  these  representation  problems,  expens  must  be  allowed, 
with  minimal  constraints,  to  describe  their  cognitive  processes 
with  respect  to  the  intended  application.  Typically,  the  expert 


w  ill  define  concepts  that  represent  personal  observations  in  the 
domain  to  be  modeled.  Concepts  arc  defined  as  symbols  (text  or 
image)  that  capture  the  meaning  or  intention  of  objects  and 
events  of  an  environment  (Ausubel,  Novak  and  lianesian, 
1978).  for  example,  an  electrical  system  is  an  object,  and  boil 
water  in  an  event.  In  essence,  a  concept  is  a  simplified  and 
gencraliicd  description  of  reality  for  a  particular  level  of 
abstraction  (Novak  and  Gowin.  1984). 

Concept  formulation  may  be  defined  as  the  process  of 
extracting  the  characteristic  features,  termed  critcrial  attributes, 
of  the  objects  or  events.  The  critcrial  attributes  arc  what  uniquely 
distinguish  each  concept  (e.g.,  size,  color,  shape).  A  set  of 
common  features  and  the  degree  to  which  the  features  are 
accepted  by  the  experts  will  determine  the  regularity  (i.e., 
meaning)  of  a  concept.  The  regularity  implies  precision  on  the 
definition  of  the  concept  so  that  misunderstandings  can  be 
minimized,  sec  (Novak  and  Gowin,  1984).  Critcrial  attributes 
also  allow  concepts  to  be  grouped  or  linked  together  to  form 
concept  maps  that  can  represent  the  conceptual  and  structural 
knowledge  of  the  domain  experts.  A  concept  map  is  simply  a 
hierarchical  taxonomy  of  concepts.  The  hierarchy  supports  the 
natural  subsumption  of  concepts  where  broad  concept 
definitions  are  depicted  at  the  top  of  the  map  and  more  detailed 
and  concise  concepts  at  the  bottom.  Concept  maps  promote  the 
reuse  of  concept  definitions  by  allowing  the  use  of  existing 
concepts  (e  g.,  door)  in  different  contexts  (e  g.,  house,  car). 
Such  a  virtual  feature  is  natural  for  multiexperl  integration 
because  it  provides  support  for  a  modular  structure  and  the 
development  of  a  library  facility  of  reusable  concept  definitions 
across  domains  where  applicable  (McGraw  and  Westphal, 
1988).  The  graphical  nature  of  concept  maps  also  allow 
multiexpcrt  conflict  to  be  made  explicit  thereby  excising  the 
knowledge  structure  of  faulty  linkages  and  misconceptions 
(Westphal  and  Reeker,  1990). 

The  concept  map  in  KASH  has  been  extended  to  include 
base  facts.  Base  facts  have  their  own  definition  because  they  are 
fully  instantiated  (e.g.,  parts,  ingredients,  symptoms)  and  do  not 
require  further  justification  for  their  existence.  Base  facts  are 
measurable  and  observab'e  in  the  context  of  the  concept  and 
support  the  criterial  attributes  of  a  domain.  The  criteria!  attributes 
are  necessary  for  distinguishing  between  concepts  and 
structuring  questions  into  categories  during  concept  formulation. 
The  critcrial  attributes  also  support  the  rule  generation  module 
because  they  are  variable  entities  (e.g.,  size  is  big,  color  is  red) 
that  can  be  compared,  contrast'd,  or  combined  with  other 
critcrial  attributes  to  lorm  the  rules  produced  by  K/\SH. 


Question  Dependency  Graph 

To  facilitaic  the  development  of  a  concept  map,  the  experts 
must  be  interviewed  using  questions  pertinent  to  the  content  and 
structure  of  the  problem  domain.  As  Novak  (1989)  stipulates, 
the  sequencing  of  questions  presented  to  the  expert  should  be 
tailored  or  grouped  into  specific  sets  of  knosvlcdgc  and  the  range 
of  these  sets  should  address  questions  at  broad  superordinate 
levels  and  become  more  narrow  and  precisely  defined  at  the  base 
level.  In  KASII,  the  questions  permissible  to  ask  of  the  domain 
experts  during  the  development  of  the  concept  map  are  specified 
by  the  knowledge  engineers  and  represented  in  a  question 
dependency  graph  (QDG).  The  QDG  has  been  abstractly  based 
on  the  six  types  of  questions  as  defined  by  LaFrance  (1987) 
where  each  question  type  decomposes  the  QDG  into  a  category 
of  queries  (broad  and  specific)  to  be  asked  with  respect  to 
conceptual  definition.  The  six  types  are: 

•  Grand  Tour ,  to  identify  domain  and  subdomain  boundaries, 
e.g.,  "What  is  the  purpose  of  this  system?" 

•  Cataloging  the  Categories,  to  support,  organize,  and 
structure  the  concepts,  e.g.,  "Can  the  concepts  be  ordered?" 

•  Ascertaining  tltc  Attributes,  to  specify  values  and  ranges  of 
the  criterial  attributes  for  concepts  and  base  facts,  e  g  . 
"What  values  can  attribute  assume?" 

•  Determining  the  Interconnects,  to  define  the  causal 
relationships  between  concepts,  e.g.,  "Docs  base  fact 
support  the  concept?’ 

•  Seeking  Advice  and  Cross  Checking,  to  compare  and 
contrast  information  in  the  concept  map.  e.g.,  "Does 
concept  contain  any  base  facts?”  or  "What  value  of  attribute 
is  out  of  range?" 

During  the  development  of  a  QDG.  a  knowledge  engineer 
must  specify  the  questions  that  will  be  presented  to  the  domain 
expert.  The  questions  arc  defined  to  separate  the  control 
knowledge  from  content  knowledge  of  the  domain  (analysis  or 
synthesis).  It  is  important  for  the  knowledge  engineer  to 
understand  the  basic  structure  of  the  domain  to  effectively  define 
questions,  categorize  the  questions  by  type,  and  determine  the 
control  sequence  of  the  questions.  Inefficient  specifications  at 
this  phase  can  lead  to  poor  interview  sessions  with  the  domain 
expert,  who  will  supply  the  content  know  ledge  (response  to  the 
questions).  In  the  QDG,  the  questions  are  represented  as  node 
structures  and  are  linked  to  other  node  structures  to  determine 
the  sequencing. 

.  de  structure  consists  of  a  question  frame,  question 
type,  'Tcri,on  guide,  and  a  variety  of  support  attributes.  The 
questi o-  ne  contains  the  actual  text  generated  during  a  query. 
Question  ;e  developed  by  the  knowledge  engineers  about  the 
control  structure  of  the  domain.  The  questions  derived  are  used 
to  respectively  refine  or  compose  the  concepts  in  an  analysis  or 
synthesis  domain.  The  six  different  question  types  defined 
above  are  used  to  insure  the  scope  of  the  questions  are  limited  to 
the  current  query  category.  By  doing  so,  the  focus  of  the  experts 
arc  concentrated  on  the  particular  task  level  being  modeled.  The 
selection  guide  is  satisfied  when  the  knowledge  engineer 
specifics  the  response  expected  from  the  question  frame  with 
respect  to  one  (or  several)  of  the  system  objects  specified  in 
KASH.  These  objects  are  a  prioritized  taxonomy  of  concepts, 
base  facts,  edges,  attributes,  and  standard  representation 
mechanisms  (i.e,  integers,  reals,  text,  etc)  that  help  to  resolve 
which  question  to  ask. 

The  edges  form  the  explicit  relationships  between  the  node 
structures  and  determine  which  tuples  of  questions  may  be 
presented  to  the  domain  experts.  A  tuple  of  two  questions  is 
interpreted  to  imply:  if  question  one  is  asked,  then  question  two 
may  be  asked.  There  can  be  manv-many  rclarinnships  between 
the  node  structures,  thus  forming  a  large  combination  of 
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questions  tuples.  The  edges  are  coupled  with  edge  weights  to 
determine  the  ordering  of  question  preference  -  a  form  of 
preliltering.  These  weights  may  be  displayed  as  a  range  of 
numbers  or  as  an  alternative  representation  (i.e.,  low,  med, 
high). 


ON.  Decompose  Concept 
QT.  Grand  Tour 


QF;  What  are  the  possible 
diagnoses  of  the  concept 
SG:  concept 


A  question  dependency  graph  based  on  a  diagnosis  process.  The  order  of 
node  letters  and  the  high,  medium,  or  low  priorities  indicates  direction. 
->  is  used  to  denote  explicit  control. 


Figure  2:  Question  Depeiulency  Graph 

Figure  2  shows  an  example  of  a  partial  question  dependency 
graph  for  a  circuit  fault  diagnosis.  Each  circle  represents  a 
unique  question  (defined  by  a  knowledge  engineer)  that  may  be 
asked  during  a  knowledge  acquisition  session.  The  box 
represents  the  node  structure  (only  one  is  detailed)  for  a 
particular  query  scheme.  The  question  name  is  Decompose 
Concept,  the  question  type  (QT)  is  Grand  Tour,  the  question 
frame  (QF)  is  defined  as  "What  are  the  possible  diagnoses  of  the 
concept?"  (variable  terms  in  the  question  will  get  instantiated 
during  execution),  the  selection  guide  (SG)  is  expecting  the 
response  of  the  domain  expert  to  be  a  concept,  and  a 
miscellaneous  collection  of  support  attributes  helps  to  specify 
display  formats  (DF)  and  the  control  strategy  (CS).  The  links 
between  the  nodes  are  labeled  with  high,  medium,  and  low 
priorities  to  indicate  the  control  paths  the  system  should  pursue. 
In  this  example  node  D  is  connected  to  nodes  E  and  F.  The 
heuristics  defined  in  the  concept  formulation  module  will  select 
the  better  candidate  question  (E  or  F)  to  ask  based  on  the  state  of 
the  concept  map. 

A  QDG  editor  is  used  for  the  construction  of  the  question 
graphs.  The  editor  allows  the  knowledge  engineers  lo  construct 
specialized  question  bases  through  QDG  link  specifications. 
New  questions  may  be  introduced  into  the  graph  at  any  time. 
The  set  of  base  questions  is  always  available,  however  it  is  the 
format  in  which  they  are  connected  that  will  give  a  system  its 
functionality.  This  combination  of  nodes  and  links  allow  KASH 
to  be  used  across  a  large  range  of  applications  and  proves  to  be 
very  versatile  and  powerful  throughout  the  knowledge  modeling 
process.  As  the  QDG  changes,  so  will  the  system,  because  the 
QDG  is  the  canonical  guidance  referenced  throughout  the  three 
modules  of  KASH:  concept  formulation,  knowledge  analysis, 
and  rule  generation. 

Concept  Formulation 

Concept  formulation,  necessary  to  acquire  the  knowledge 
structure  from  a  domain  expert,  is  interleaved  between  a  top- 
down  and  bottom-up  elicitation  strategy  guided  by  the  QDG. 
The  top  down  refinements  are  focused  on  acquiring  the  abstract 
concepts  that  are  used  to  deFtne  the  components  of  a  domain. 
These  components  are  in  turn  supported  by  bottom-up  facts  that 
icpresent  the  logical  entities  or  extension  of  the  concepts.  The 
use  of  top-down  and  bottom-up  strategies  in  concept  formulation 
reflect  the  structures  that  exists  in  the  different  levels  of 


knowledge  and  the  control  nature  of  the  analysis  and  synthesis 
classes  of  problems1.  The  analysis  problems  involve  identifying 
sets  on  objects  based  on  their  features.  Synthesis  problems 
construct  a  solution  from  component  pieces  or  subproblem 
solutions.  Figure  3  is  a  section  of  (he  top-down  (and  bottom-up) 
process  trace  of  a  concept  formulation  based  on  the  QDG  for  a 
circuit  fault  diagnosis.  The  index  (A-M)  for  each  instantiated 
question  text  shown  (i.e.,  variables  are  bound  within  the  context 
of  the  question)  corresponds  to  a  node  in  the  QDG  shown  in 
Figure  2. 


A)Define  ihc  terminology: 

ll)What  are  some  attributes 

Top-down  icrms(s)  :>  diagnosis. 

of  sensor  mounts? 

Doltom-up  tcrms(s)  :>  symptoms. 

:>  connectors. 

B)What  is  the  purpose  of 

l)Whal  valur(s)  can  connectors 

this  diagnosis? 

of  sensor  mounts  assume? 

:>circuit  fault. 

:>  loose,  broken,  grounded,  working. 

C/Define  any  symptoms  of 

i)ls  connectors  used  in 

a  circuit  fault. 

sensor  grounding? 

:>manifold  threads. 

:>  yes 

DJVVhat  are  the  possible  diagnoses 

K)What  valuc(s)  of  connectors  would 

of  a  circuit  fault? 

make  sensor  grounding  succeed7 

:>  control  circuit  failure 
:>  sensor  fault. 

:>  grounded,  working. 

L)\Vhat  valuc(s)  of  connectors  would 

E)Can  these  be  ordered? 

make  sensor  grounding  fail? 

:>  no. 

:>  loose,  broken. 

F)ls  manifold  threads  a  symptom  of 

M)Is  tins  a  defjull  or  infmr1  value? 

control  circuit  failure,  sensor  fault? 
:>  sensor  fault. 

:>  inferred 

N)Can  connectors  be  related 

G)Are  tliere  any  oilier  symptoms 

lo  other  aunbules? 

of  sensor  fault? 

:>  sensor  mounts. 

:>  no 

H)NVhal  are  some  attributes  of 

D)What  are  the  possible  diagnoses 

manifold  threads? 

of  sensor  fault? 

:>  broken  sensor. 

:>  connectors. 

:>  sensor  grounding. 

l)What  value(s)  can  connectors 
of  manifold  threads  assume? 

E)Can  these  be  ordered? 

:>  no. 

:>  dirty,  corcoded.  clean,  working. 

J)ls  connectors  used  in 

F)ls  manifold  threads  a  symptom  of 

sensor  grounding? 

sensor  grounding,  broken  sensor? 

:>  sensor  grounding. 

:>  yes 

K)Whal  va1ue(s)of  connectors  would 

f*)Is  sensor  mounts  a  symptom  of 

make  sensor  grounding  succeed? 

sensor  grounding,  broken  sensor? 

:>  sensor  grounding. 

:>  clean,  working. 

L)What  value(s)  of  connectors  would 

G)Are  there  any  other  symptoms 

make  sensor  grounding  fail? 

of  sensor  grounding? 

:>  voltage  test. 

:>  dirty,  corroded. 

M)ls  this  a  default  or  inferred  value? 

D)Whai  are  the  possible  diagnoses 
of  sensor  grounding? 

:>  inferred. 

:>  none. 

N)Can  connectors  be  related  to 
other  attributes? 

D)What  are  the  possible  diagnoses 

:>  no 

of  broken  sensor? 

• 

:>  none. 

• 

• 

Figure  3:  Question  Trace  from  QDG 


The  concept  formulation  module  will  iteratively  execute  six 
stages  during  the  construction  of  the  concept  map.  These  stages 
arc  responsible  for  selecting  a  concept  from  the  graph, 
establishing  base  facts  for  the  concept,  formulating  a  question  to 
apply  to  the  concept,  generating  the  question,  then  presenting 
and  accepting  the  results  from  the  expert. 


' X usubel  (1978)  addresses  these  abstrar-.ons  in  his  propositional  learning 
theory. 
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circuit  fault 


manifold  sensor  voltage 

threads  mounts  test 


Heuristic  Criteria  for  Concept  Selection  -  A  concept 
is  selected  from  the  set  of  open  concepts.  This  set  represents 
those  concepts  that  have  not  been  fully  explored  by  the  QI>G 
such  that  there  arc  additional  questions  that  may  be  asked 
about  them.  The  concept  is  selected  based  on  several  factors 
including  the  order  of  importance  that  is  explicitly  specified 
by  the  user,  a  system  defined  depth  first  or  breadth  first 
selection  mode,  the  recency  of  a  concept  definition,  the  level 
ol  attributes  defined,  the  number  of  established  links  to  the 
base  facts,  the  number  of  derived  (children)  concepts,  or  a 
concept  of  enumerated  type. 

Establish  Base  Facts  -  The  first  step  taken  after  the 
concept  has  been  selected  is  to  elicit  the  base  facts  that 
support  the  concept.  These  base  facts  define  the  fundamental 
units  of  the  domain.  The  new  base  facts  are  specified  by  the 
user  and  supportive  links  arc  established  to  the  selected 
concept.  User  selected  base  facts  arc  then  passed  down  to 
the  selected  concept  from  a  parent  concept.  Note,  parent 
concepts  must  resolve  all  supportive  links  before  the  concept 
map  is  considered  complete. 

Select  Questions  -  The  list  of  applicable  questions  for  the 
current  concept  is  calculated  by  obtaining  all  the  edges 
attached  to  the  previous  question  in  the  QDG  and  placing 
them  into  a  list.  A  duplication  filter  is  applied  to  the  list  to 
resolve  any  conflicts  found  in  a  list  of  previously  ask 
questions  maintained  as  a  concatenation  of  the  concept  name 
and  the  question  identifier.  The  list  is  then  processed  through 
some  ordering  filters  that  act  on  the  edge  weights,  selection 
guide  matching,  and  groups  division  types.  The  product  of 
these  filters  results  in  a  refined  list  of  questions  to  be  asked. 

Generate  Question  -  The  product  of  the  generate 
question  stage  is  the  selection  of  a  single  query  node  made 
from  the  refined  list.  This  list  of  applicable  questions  for  the 
current  concept  is  acted  on  by  a  function  of  the  question 
types  and  other  system  factors  (not  detailed  here). 

Question  Presentation  -  After  a  single  question  has  been 
selected  it  is  presented  to  the  user  to  solicit  an  answer.  The 
control  panel  is  consulted  for  the  appropriate  terms  (top 
down/bottom  up),  display  formats  (bar  graphs,  text,  pie 
chart,  menu),  and  concept  graph  information  (attributes, 
types,  links). 

Answer  Questions  -  The  expert  responds  to  the  question 
through  either  text  (e.g.,  attributes,  concepts,  base  facts, 
names,  or  values)  or  concept  graph  management  facilities 
(e.g,  graphics,  node  selection). 

The  output  expected  from  the  concept  formulation  module  is 
a  concept  map  representing  a  top-down  or  bottom-up  process 
model.  The  concept  map  represents  the  observations  of  domain 
experts  as  guided  by  the  control  structure  specified  in  the  QDG. 
Figure  4  shows  the  concept  map  for  (he  circuit  fault  example.  If 
the  execution  of  the  module  has  not  fully  defined  the  range  of  the 
expert's  knowledge,  the  QDG  may  be  reconfigured  to 
accommodate  the  missing  information.  From  this  point  the 
information  gleaned  from  the  expert  in  the  form  of  concept 
modules  and  base  facts  is  verified  by  both  the  system  and  the 
domain  expert.  Several  structure  analysis  techniques,  described 
below,  have  been  defined  to  accommodate  this  task. 

Knowledge  Analysts 

The  knowledge  analysis  module  is  a  hybrid  of  several 
interactive  subsystems  that  are  used  to  validate  and  refine  the 
concept  map.  During  validation,  the  knowledge  acquired  from 
the  concept  formulation  module  is  analyzed  to  bring  out 
inconsistent,  incomplete,  and  unjustified  information  that  may 
occur  in  particularly  large  systems  or  from  the  use  of  multiple 
experts.  In  refining  the  concept  map,  the  system  will  look  for 
ways  to  enhance  the  structure  of  the  model  through  the 
application  ot  a  variety  of  techniques.  Several  proposed 
techniques  are  described  below,  also  see  Figure  5. 


Figure  4:  Concept  Mop  for  Circuit  Fault 

(.luster  Analysis  -  Cluster  analysis  detects  cases  in 
which  large  numbers  of  concepts  are  derived  from  (or 
clustered  around)  a  more  general  concept.  It  suggests  that  the 
general  concept  is  too  broad  and  it  should  imply  several  new 
concepts  where  the  other  concepts  can  be  derived  from,  or 
sonic  of  the  derived  concepts  should  be  promoted  to  the  the 
intermediate  level  concepts  between  the  general  concept  and 
Others.  The  introduction  of  intermediate  concepts  helps 
structure  the  knowledge  elicitation  stage  and  reduces  the 
complexity  of  the  acquired  knowledge. 

Enlailntcnl  Analysis  -  The  basis  for  entailment  analysis 
stems  from  the  attributes  associated  with  the  concepts.  The 
technique  makes  the  entailmcnts  between  concepts  obvious. 
There  will  be  cases  in  which  the  entailmcnts  indicate  that  a 
concept  can  be  subsumed  by  another  concept  because  it  is 
derived  by  another  concept  that  has  no  other  derivation. 

Similarity  Analysis  -  The  knowledge  engineer  submits 
to  the  system  a  number  of  known  cases  for  analysis.  Each  of 
these  cases  is  represented  in  the  format  compatible  with  the 
acquired  knowledge.  The  system  will  try  (o  justify  the  given 
facts  and  results  of  the  cases.  If  the  facts  and  results  cannot 
be  explained,  the  system  will  explain  the  failure  and  it  will 
seek  help  from  the  user  by  asking  questions  from  the  QDG 
focusing  on  the  unexplained  concepts  and  the  failure. 

Relative  Analysis  -  Relative  analysis  is  based  on  the 
reuse  of  concept  modules  in  the  graph.  A  concept  associated 
with  a  base  fact  is  selected  for  review.  The  analysis 
technique  will  propagate  up  the  structure  to  a  level  outside 
the  immediate  Hierarchy  associated  with  the  concept  and 
query  the  expert,  via  the  QDG,  if  it  can  be  used  in  the 
adjacent  paths.  This  type  of  analysis  supports  the  use  of 
virtual  structures  and  serves  as  a  form  of  memory  cue 
entailment  for  additional  structure  refinements. 

Subcomponent  Analysis  •  A  base  fact  is  selected  from 
an  active  path  in  the  concept  structure.  This  path  consists  of 
the  base  fact  and  all  the  concepts  that  comprise  the  p3th  to  the 
root  node.  The  base  fact  is  compared  to  each  the  concepts 
that  subscribe  to  the  path  and  questioned  by  the  QDG  for 
relevancy  against  the  concept.  Negative  responses  will 
indicate  inconsistencies  in  the  structure  and  will  be  resolved 
through  further  QDG  examinations. 
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f'nnreptnnl  Analysis  •  Attempts  to  enforce  the  ptincip.il 
tint  every  hive  fact  must  he  nli.vhed  to  a  concept  tint  is  not 
(milter  decomposed  hv  adtlilionil  concepts  If  any  such  case 
tv  encountered  it  cin  he  assumed  that  there  arc  additional 
conceptual  levels  that  have  not  been  defined.  Additionally,  if 
a  concept  has  been  decomposed  into  another  single  concept, 
then  the  linear  formation  will  invoke  the  analysis  to  collapse 
both  in'n  a  unary  concept  structure. 

Knowledge  analysis  can  he  interactive  or  selectahle.  In  the 
lust  mode,  interactive ,  a  number  of  knowledge  analysis 
techniques  are  selected  from  a  menu  hefore  the  concept 
formulation  module  is  executed.  As  knowledge  is  acquired  these 
techniques  arc  automatically  invoked  to  yield  analysis  results. 
I  lie  results  arc  shown  in  a  side  window  with  the  appropiiale 
marks  to  indicate  the  different  degrees  of  importance.  In  the 
second  mode,  t elenian,  the  knowlcdpc  analysis  slapc  is  entered 
after  a  knowlcdpc  elicitation  session,  'the  manner  in  which  the 
analysis  results  arc  shown  is  the  same  as  in  the  previous  mode, 
hut  more  thorouph  analysis  techniques  can  he  applied, 
particularly  the  similarity  analysis.  Hie  topics  of  refinement  and 
analyses  to  be  performed  are  based  on  a  prioritized  scheme  and 
will  execute  after  any  data  chanpcs  or  alterations  to  the  concept 
structure  arc  made. 


Rule  nenernfion 

The  third  module  of  KAMI  is  used  to  support  the  final 
process  of  knowledge  acquisition,  rule  generation.  This  module 
creates  a  set  of  rules  that  can  be  integrated  into  a  third  party 
expert  system  shell.  The  three  major  factors  that  influence  the 
generation  of  the  rules  are  the  conrept  map,  rule  generation 
strategy,  and  target  expert  system  shell. 


I  irst.  the  concept  map  is  extended  to  include  any  terms 
(ctitcrial  attributes)  to  he  user!  in  the  production  of  the  rules. 
New  attrihiilcs  .r-c  created  and  manipulated  by 
iiirnhntr<  -  xtttrihutr,  littr ihitrc  ■_  -  V n irn  f'j'!,  and  attnl'utc<  ■  >  value 
questioning  Schemes  defined  in  the  t.'IINi  T  he  information 
provided  by  the  cx|vits  in  response  to  the  questions  specifies  the 
am  ibules  of  a  concept,  the  relationships  of  the  attributes  lo  other 
attributes,  and  the  range  of  values  the  attributes  rn.iv  assume 
Attribute  definitions  are  initialed  for  each  of  the  base  lacls  in  the 
concept  map  The  definitions  are  propagated  to  the  concepts  via 
an  inverted  inheritance  scheme  with  question  filters  to  terminate 
any  attribute  stream.  The  attribute  relationships  in  the  concepts 
represent  rules.  The  rules  arc  local  lo  the  concept  Tor  w  hich  thev 
arc  defined  atirl  may  be  classified  as  either  tlcfnilr,  w  here  new 
logical  relationships  between  attributes  are  specified  (i  c.  set  the 
temperature  of  material  lo  equal  the  temperature  of  the  gas),  or 
inferred  where  the  system  must  calculate  the  value  of  the  used 
attribute  (i.c  .  the  temperature  of  I  lie  material  must  equal  the 
temperature  of  the  gas)  The  inferred  values  arc  present  in  the 
rule  premises  and  the  default  values  typically  appear  in  the 
consequents. 

Second,  in  the  rule  generation  strategy  the  local  rules  arc 
used  hv  a  series  of  high  level  lop  dow  n  and  bottom-lip  strategics 
that  determine  the  direction  in  which  final  rules  arc  developed. 
-  flic  top  down  strategy  employed  in  gcnciating  rules  is  based  on 
a  concept  and  the  links  lo  its  supportive  conerpts  The  bottom- 
up  strategy  directs  the  rule  generation  based  on  the  lower  level 
concepts  lo  the  higher  level  concepts  that  they  support.  I  Ins 
strategy  supports  both  forward  and  backward  chaining  rule 
specifications  for  integration  into  the  expert  system 

Third,  the  characteristic  or  mivfcl  of  the  target  expert  system 
shell  language  influences  the  different  ways  in  which  an  optimal 
rule  set  may  he  generated.  This  depends  on  the  different 
representation  schemes,  control  strategics,  dramons.  triggers, 
etc.  encountered  in  the  different  expert  system  shells,  for 
example,  instead  of  explicitly  stating  separate  rules  as  would  he 
necessary  in  a  pure  production  system  such  as  OrS-5.  the 
number  of  rules  generated  can  be  reduced  in  an  object  otiented 
system  such  as  NIIXPRRT  or  Klill  due  to  the  abstraction  of 
frame  structures. 

f  igure  6  shows  the  partial  set  of  tides  that  have  been  created 
for  the  circuit  fault  diagnosis  example.  Heforc  these  rules  arc 
convened  into  a  target  expert  system  format  they  must  he 
reviewed  by  a  scries  of  analysis  techniques  to  ensure  there  arc 
no  redundant,  conflicting,  or  circular  rules.  The  final  ('income  is 
a  knowledge  base  of  objccls,  relationships,  and  rules  that  may 
be  used  in  a  variety  or  expert  system  applications. 

Supportive  Modules 

There  exists  a  number  of  external  features  that  will  be  added 
to  fhc  basic  functionality  of  KASII.  These  features  are  intended 
to  extend  suppoit  to  the  knowledge  acquisition  process  and 
supplement  the  work  of  the  QDO.  The  following  represent  a 
subset  of  the  planned  features  to  be  developed: 

•  The  lime  issues  associated  with  a  domain  model  will  he 
addressed  through  a  temporal  subsystem  The  concept 
graphs  will  be  modified  to  accommodate  the  values 
necessary  to  represent  the  start,  end.  and  latency  periods. 
The  question  dependency  graph  will  also  he  updated  to  elicit 
the  necessary  information.  The  temporal  information  will  he 
represented  through  a  time  bnx  consisting  of  a  series  of 
levels  each  partitioned  into  the  respective  intervals  specified 
during  the  temporal  questioning.  This  time  box  can  be 
viewed  from  varying  levels  of  detail  depending  on  the 
concepl  graph  depth. 

•  An  enhanced  control  panel  will  contain  a  multitude  of  user 
defined  parameters  to  coordinate  the  elicitation  sessions  The 
panel  will  contain  the  terms  used  for  the  top-down  and 
bottom-up  definitions.  Trc-logical  settings  for  which  top- 
down  entries  arc  best  suited  for  the  botlom-up  selections  will 


be  available.  The  search  strategics  (i.e.,  depth  first,  breadth 
first)  can  be  set  from  the  panel.  A  question  list  is  defined  in 
the  panel  and  used  to  disable  groups  of  questions  for  the 
session  (usually  low  priority  settings).  Additionally,  the 
analysis  techniques  can  be  made  interactive  or  selectable 
from  the  panel  settings. 

•  A  history  window  will  supply  a  meta-command 
interpretation  of  the  events  occurring  in  the  system.  This  will 
provide  documentational  support  to  analyze  the  elicitation 
session  as  it  progresses  over  time.  The  log  will  be  used  to 
archive  the  state  of  the  concept  model  for  easy  reconstruction 
at  a  future  date.  Additionally  the  history  list  will  be 
constructed  for  path  resolution  of  the  virtual  system 
hierarchy. 

•  A  glossary  of  terms  defined  by  the  expert  during  the 
elicitation  of  concepts  and  facts  will  be  supported  as  a 
scrolling  scries  of  menus.  This  facility  can  support  the 
comments  associated  with  multiple  expert  development.  The 
glossary  support  system  can  he  expected  to  emulate  a  hyper- 
expert  notation  scheme  where  the  terms  defined  can  be  a 
mixture  of  text  and  graphics 


IF  level_of_voltagejest  <  3 

THEN  position_of_sensor_grounding  -  "open" 

IF  level_of_voltage_test  >«  3 
AND  !evel_of_voltage_test  <  5 
THEN  posilion_of_sensor_grounding  -  "closed" 

IF  connectors_ol_sensor_mounts  «  "loose" 

THEN  cond_of_sensor_grounding  -  "poor" 

AND  recommenda!ion_of_sensor_grounding  -  "tighten" 

IF  connectors_o(_sonsor_mounts  -  "broken" 

THEN  cond_o(_sensor_groundinq  -  "poor" 

AND  recommendation_o(_sensor_grounding  «  "replace" 

IF  connectors_o(_mani(old_lhreads  -  "dirty" 

THEN  cond_of_sensor_grounding  -  "poor" 

AND  recommendation_of_sensor_grounding  -  "clean" 

IF  connectors_o(_maniloldjhreads  -  "corroded" 

THEN  cond_ol_sensor_grounding  -  "poor" 

AND  recommendation_of_sensor_grounding  -  "replace" 

IF  cond_of_sensor_grounding  »  "poor" 

AND  position_ol__sensor_grounding  *  "open" 

THEN  status_ol_sensor_faul!  »  "true" 

AND  recommendation_of_sensor_grounding  ■ 
recommendation_of_sensor__fault 
AND  action_ol_sensor_fault  -  "shutdown" 

IF  cond_o!_sensor_grounding  -  "poor" 

AND  posilion_ol_sensor_grounding  .  "closed* 

THEN  status_ot_sensor_fault  «  "true" 

AND  recommendation_ol_sensor_grounding  • 
recommondation_of_sensor_faull 
AND  aclion_ol_sensor_fault  -  "maintenance" 

IF  status_of_sensor_laull  -  "true" 

THEN  status_of_circuit_faul!  -  "irue* 

AND  recommendation_o!_circult_fault  ■ 
recommendation_of_sensor_(au!t 
AND  aclion_of_circuil_laull  • 
action_of_sensor_faull 


Figure  6:  KASII  Circuit  Fault  Rule  Set 


Conclusion 

KASI I  is  a  general  purpose  knowledge  acquisition  shell  that 
acquires  information  about  a  domain  from  an  expert.  Because 
KASII  has  been  designed  to  support  both  analysis  and  synthesis 
applications,  it  may  be  applied  to  a  broad  range  of  systems 
including  planning,  design,  classification,  scheduling,  decision 
support,  and  diagnosis  where  complex  knowledge  is  often 
acquired  from  multiple  domain  experts.  The  three  modules 
supplied  by  KASH  provide  the  capability  to  acquire  knowledge 
in  context  (i.e.,  customized  for  each  domain):  Concept 
Formulation,  for  structuring  and  eliciting  the  knowledge  from  a 
domain  expert  into  a  concept  map:  Knowledge  Analysis,  to 
validate  the  concept  map;  and  Rule  Generation,  to  produce  a  rule 
set  for  an  expert  system  based  on  the  concept  map.  All  the 
information  elicited  from  a  domain  expert  is  guided  by  a 
question  dependency  graph  (QDG).  The  QDG  is  developed  by  a 
knowledge  engineer  to  separate  the  control  knowledge  from  the 
application  knowledge.  Thus,  the  QDG  is  reconfigurablc  for 
customerization  across  applications  and  domain  experts. 
Furthermore,  KASII  has  the  capabilities  for  producing  a 
knowledge  base  that  can  be  used  to  generate  alternative  rule  sets 
for  different  expert  system  shells. 
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